Text classification using document-document semantic similarity
نویسندگان
چکیده
منابع مشابه
Semantic similarity based web document classification using support vector machine
With the rapid growth of information on the World Wide Web (WWW), classification of web documents has become important for efficient information retrieval. Relevancy of information retrieved can also be improved by considering semantic relatedness between words which is a basic research area in fields of natural language processing, intelligent retrieval, document clustering and classification,...
متن کاملDocument Classification Using Semantic Networks with An Adaptive Similarity Measure
We consider supervised document classification where a semantic network is used to augment document features with their hypernyms. A novel document representation is introduced in which the contribution of the hypernyms to document similarity is determined by semantic network edge weights. We argue that the optimal edge weights are not a static property of the semantic network, but should rathe...
متن کاملA semantic partition based text mining model for document classification
Feature Extraction is a mechanism used to extract key phrases from any given text documents. This extraction can be weighted, ranked or semantic based. Weighted and Ranking based feature extraction normally assigns scores to extracted words based on various heuristics. Highest scoring words are seen as important. Semantic based extractions normally try to understand word meanings, and words wit...
متن کاملSimilarity Measures for Text Document Clustering
Clustering is a useful technique that organizes a large quantity of unordered text documents into a small number of meaningful and coherent clusters, thereby providing a basis for intuitive and informative navigation and browsing mechanisms. Partitional clustering algorithms have been recognized to be more suitable as opposed to the hierarchical clustering schemes for processing large datasets....
متن کاملMental Images of Text: Learning Document Similarity using Web Photos
Modern search engines rely solely on text to analyze the content of Web documents. However, it is well known that humans often incorporate ”mental visualization” in the form of mental images in order to interpret text. Psychological studies have demonstrated that humans are able to create these visual perceptions even in absence of external visual stimuli. Such a physiological behavior in the h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Web Science
سال: 2013
ISSN: 1757-8795,1757-8809
DOI: 10.1504/ijws.2013.056572